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MINIMAL INSTRUMENTATION FOR LOSSLESS CALL PROFILING 
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Technical Field of the Invention 

The present invention pertains generally to development of computer software 
applications, and more particularly to performance analysis of such applications. 



Ci 15 Background of the Invention 

"m Computer software appUcations typically consist of many modules, known as 

'"^""^ fiinctions, which work together to perform desired tasks. A significant concern in 

C software development is operational speed, which is largely dependent on the 

performance of the individual fimctions making up the apphcation. Thus, an important 
£^ 20 component of application development is monitoring application performance, 
p To provide usefixl information, performance measurement should, to the extent 

possible, avoid interfering with the normal operation of the application. Because 
performance measurement involves collecting data in circumstances under which data 
would not ordinarily be collected, performance measurement necessarily affects the 
25 normal operation of the application to a certain extent. Certain factors contribute to this 
effect. For example, inserting data collection code in an application makes the 
application larger, which slows operation down somewhat. This decrease in performance 
is attributable to the increase in the amount of code that must be both loaded into the 
system's memory and executed. In addition, adding data collection code increases the 
30 amoimt of time other operations take due to the execution of the data collection code. As 
a result, the time relationships between the other activities within the application and 
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within the system are affected. Data collection also affects application performance as a 
result of input/output (I/O) bandwidth considerations. Specifically, when data is 
collected, it must be stored, for example, in physical memory, on a disk, or on a 
networked device. Storing data consumes significant time due to the relatively slow 
5 process of outputting data to the storage device. This burden is especially considerable 
when large amounts of data are collected. While the burden can be reduced by adding a 
dedicated I/O device, such as a disk drive, device contention may still arise at the system 
level. 

10 Summary of the Invention 

According to various example implementations of the invention, there is provided 
a process for collecting information relating to the execution of an application for use in 
performance analysis. Probes are inserted at only those points in the application that 
would yield non-redundant information. By avoiding the collection of redundant 

15 information, the effect of the information collection process on the application is reduced. 
Additionally, performance of the data analysis process is improved due to the reduction 
of cases in which it must be decided whether data is redundant, and also due to the 
reduction of work involved in discarding redundant data. The accuracy and usefulness of 
the collected information are thus improved. 

20 In a particular implementation, probe locations are determined at which collecting 

data relating to the execution of the apphcation would produce non-redimdant 
information. Probes are then inserted only at these locations. In general, these locations 
include the entry and exit points of the functions making up the application. Other 
particular implementations of the present invention relate to, for example, exceptional 

25 cases in which probes are inserted at other locations in addition to entry and exit points. 
In another particular implementation, the entry and exit points of functions 
constituting at least a portion of the application are determined. Also determined are 
probe locations in the application at which collecting data relating to the execution of the 
application would produce non-redundant information. These probe locations include at 

30 least the determined entry and exit points. Probes are inserted only at the determined 
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probe locations in the application. These probes are then used to collect the information 
relating to the execution of the apphcation. The collected information is then analyzed. 

Still other implementations include computer-readable media and apparatuses for 
performing these methods. The above summary of the present invention is not intended 
5 to describe every implementation of the present invention. The figures and the detailed 
description that follow more particularly exemplify these implementations. 

Brief Description of the Drawings 

Figure 1 illustrates a simplified overview of an example embodiment of a 

computing environment for the present invention. 
Figure 2 is a flow chart illustrating an example method for analyzing the 

performance of an application, according to an embodiment of the 
present invention. 

Figure 3 depicts a portion of an application with potential probe locations. 
Figure 4 depicts an example instrumentation scheme according to one 

operational example of an embodiment of the present invention. 
Figure 5 depicts another example instrumentation scheme according to 
another operational example of an embodiment of the present 
invention. 

Detailed Description of the Invention 

In the following detailed description of the preferred embodiments, reference is 
made to the accompanying drawings that form a part hereof, and in which are shown by 
way of illustration specific embodiments in which the invention may be practiced. It is 
understood that other embodiments may be utilized and structural changes may be made 
without departing from the scope of the present invention. 

Hardware and Operating Environment 
Figure 1 illustrates a hardware and operating environment in conjunction with 
30 which embodiments of the invention may be practiced. The description of Figure 1 is 
intended to provide a brief, general description of suitable computer hardware and a 
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suitable computing environment with which the invention may be implemented. 
Although not required, the invention is described in the general context of computer- 
executable instructions, such as program modules, being executed by a computer, such as 
a personal computer (PC). This is one embodiment of many different computer 

5 configurations, some including speciahzed hardware circuits to analyze performance, that 
may be used to implement the present invention. Generally, program modules include 
routines, programs, objects, components, data structures, etc. that perform particular tasks 
or implement particular abstract data types. 

Moreover, those skilled in the art will appreciate that the invention may be 

10 practiced with other computer-system configurations, including hand-held devices, 
multiprocessor systems, microprocessor-based or programmable consumer electronics, 
network personal computers ("PCs"), minicomputers, mainjframe computers, and the like. 
The invention may also be practiced in distributed computing environments where tasks 
are performed by remote processing devices linked through a communications network. 

15 In a distributed computing environment, program modules may be located in both local 
and remote memory storage devices. 

Figure 1 shows a computer arrangement implemented as a general-purpose 
computing or information-handling system 80. This embodiment includes a general 
purpose computing device such as personal computer (PC) 120, that includes processing 

20 unit 121, a system memory 122, and a system bus 123 that operatively couples the system 
memory 122 and other system components to processing unit 121. There may be only 
one or there may be more than one processing unit 121, such that the processor computer 
120 comprises a single central-processing unit (CPU), or a plurality of processing units, 
commonly referred to as a parallel processing environment. The computer 120 may be a 

25 conventional computer, a distributed computer, or any other type of computer; the 
invention is not so limited. 

In other embodiments other configurations are used in PC 120. System bus 123 
may be any of several types, including a memory bus or memory controller, a peripheral 
bus, and a local bus, and may use any of a variety of bus architectures. The system 

30 memory 122 may also be referred to as simply the memory, and it includes read-only 
memory (ROM) 124 and random-access memory (RAM) 125. A basic input/output 



4 



V 



system (BIOS) 126, stored in ROM 124, contains the basic routines that transfer 
information between components of personal computer 120. BIOS 126 also contains 
start-up routines for the system. 

Personal computer 120 further includes hard disk drive 127 having one or more 
5 magnetic hard disks (not shown) onto which data is stored and retrieved for reading from 
and writing to hard-disk-drive interface 132, magnetic disk drive 128 for reading from 
and writing to a removable magnetic disk 129, and optical disk drive 130 for reading 
from and/or writing to a removable optical disk 131 such as a CD-ROM, DVD or other 
optical medium. Hard disk drive 127, magnetic disk drive 128, and optical disk drive 130 
10 are connected to system bus 123 by a hard-disk drive interface 132, a magnetic-disk drive 
interface 133, and an optical-drive interface 134, respectively. The drives 127, 128, and 
130 and their associated computer-readable media 129, 131 provide nonvolatile storage 
% of computer-readable instructions, data structures, program modules and other data for 

^ personal computer 120, Although the exemplary environment described herein employs 

S 15 a hard disk, a removable magnetic disk 129 and a removable optical disk 131, those 
i| skilled in the art will appreciate that other types of computer-readable media which can 

store data accessible by a computer may also be used in the exemplary operating 
p environment. Such media may include magnetic tape cassettes, flash-memory cards, 

.J digital video disks (DVD), BemouUi cartridges, RAMs, ROMs, and the like. 

7"; 20 In various embodiments, program modules are stored on the hard disk drive 127, 

St magnetic disk 129, optical disk 131, ROM 124 and/or RAM 125 and may be moved 

among these devices, e,g., from hard disk drive 127 to RAM 125. Program modules 
include operating system 135, one or more apphcation programs 136, other program 
modules 137, and/or program data 138. A user may enter commands and information 
25 into personal computer 120 through input devices such as a keyboard 140 and a pointing 
device 142. Other input devices (not shown) for various embodiments include one or 
more devices selected from a microphone, joystick, game pad, satellite dish, scanner, or 
the like. These and other input devices are often connected to the processing unit 121 
through a serial-port interface 146 coupled to system bus 123, but in other embodiments 
30 they are connected through other interfaces not shown in Figure 1, such as a parallel port, 
a game port, or a universal serial bus (USB) interface. A monitor 147 or other display 
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device also connects to system bus 123 via an interface such as a video adapter 148. In 
some embodiments, one or more speakers 157 or other audio output transducers are 
driven by sound adapter 156 connected to system bus 123. In some embodiments, in 
addition to the monitor 147, system 180 includes other peripheral output devices (not 

5 shown) such as a printer or the like. 

In some embodiments, personal computer 120 operates in a networked 
environment using logical connections to one or more remote computers such as remote 
computer 149, Remote computer 149 may be another personal computer, a server, a 
router, a network PC, a peer device, or other common network node. Remote computer 

10 149 typically includes many or all of the components described above in connection with 
personal computer 120; however, only a storage device 150 is illustrated in Figure 1. The 
logical connections depicted in Figure 1 include local- area network (LAN) 151 and a 
wide-area network (WAN) 152, both of which are shown connecting PC 120 to remote 
computer 149; typical embodiments would only include one or the other. Such 

15 networking environments are commonplace in offices, enterprise-wide computer 
networks, Intranets and the Internet. 

When placed in a LAN networking environment, PC 120 connects to local 
network 151 through a network interface or adapter 153. When used in a WAN 
networking environment such as the Internet, PC 120 typically includes modem 154 or 

20 other means for establishing communications over network 152. Modem 154 may be 
internal or extemal to PC 120 and connects to system bus 123 via serial-port interface 
146 in the embodiment shown. In a networked environment, program modules depicted 
as residing within PC 120 or portions thereof may be stored in remote-storage device 
150. Of course, the network connections shown are illustrative, and other means of 

25 estabUshing a communications link between the computers may be substituted. 

Software may be designed using many different methods, including object- 
oriented programming methods, C++ and Java are two examples of common object- 
oriented computer programming languages that provide functionality associated with 
object-oriented programming. Object-oriented programming methods provide a means to 

30 encapsulate data members (variables) and member functions (methods) that operate on 
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that data into a single entity called a class. Object-oriented programming methods also 
provide a means to create new classes based on existing classes. 

An object is an instance of a class. The data members of an object are attributes 
that are stored inside the computer memory, and the methods are executable computer 
code that act upon this data, along with potentially providing other services. The notion 
of an object is exploited in the present invention in that certain aspects of the invention 
are implemented as objects in some embodiments. 

An interface is a group of related functions that are organized into a named unit. 
Some identifier may uniquely identify each interface. Interfaces have no instantiation; 
that is, an interface is a definition only without the executable code needed to implement 
the methods that are specified by the interface. An object may support an interface by 
providing executable code for the methods specified by the interface. The executable 
code supphed by the object must comply with the definitions specified by the mterface. 
The object may also provide additional methods. Those skilled in the art will recognize 
that interfaces are not limited to use in or by an object-oriented programming 
environment. 

Example Embodiments of the Invention 
According to one example embodiment of the present invention, probes are 
intelligently inserted in applications to collect enough information to perform fiiU call 
attribution, i.e., to measure the amount of time any fimction takes firom start to finish, 
while avoiding collecting redundant information that is not needed to perform fiiU call 
attribution. By avoiding the collection of unnecessary information, the various 
implementations of the present invention reduce the effect of the instrumentation process 
on the application being instrumented. As a result, normal operations are disrupted less 
significantly, allowing for greater accuracy in measuring application performance. 
Instrumentation can be performed at compilation time, as a post link step, or even while 
the application is running. Furthermore, the techniques described herein are language- 
independent and can be used with all procedural languages, e.g., C, C++, Java, Pascal, 
Fortran, BASIC, Cobol, etc. 

In particular, the apphcation whose performance is to be measured is 
instrumented at logical points that can be associated with the original source code that 



was used to generate the application, while maintaining a granularity that does not 
generate data that is not needed to perform Ml call attribution. In most cases, probes are 
inserted at the entry and exit points of all discrete functions in the application. Under 
certain circumstances, however, probes are also placed at other strategic points in the 

5 apphcation. InteUigent placement of these probes ensures that enough information is 
collected to perform foil call attribution, while avoiding unduly interfering with the 
normal operation of the application by avoiding the collection of redundant information. 

According to a particular implementation of the present invention, various types 
of probes are used to monitor the performance of an application. One type labeled, for 

10 example, _EnterFunction, is placed at the entry point of a function and collects the 
address of the function being entered and the current stack pointer, as well as a time 
stamp that indicates when the fonction was entered. Another type of probe, an 
_ExitFunction probe, is placed at an exit point of a function and collects data similar to 
that collected by the _EnterFunction probe. In particular, the _ExitFunction probe 

15 collects the address of the fonction being exited, the current stack pointer, and a time 
stamp that indicates when the function was exited. Yet another type of probe, a 
__CallToFunction probe, is placed at certain points in the apphcation inraiediately before 
program flow changes from one function to another. This type of probe collects the 
address of the fonction from which the call originated, the address of the function being 

20 called, the current stack pointer, and a tune stamp indicating when the change in program 
flow occurred. A fourth type of probe, a _RetumFromFunction probe, is placed at the 
retum point of a program flow change. The _RetumFromF\mction probe collects the 
address of the function to which program flow returns, the current stack pointer, and a 
time stamp indicating when program flow returned to the previous function. 

25 For each of these types of probes, the address of fonctions is collected so that, 

when performance is analyzed, time can be accurately attributed to the correct functions. 
The probes also collect the current stack pointer to facilitate correct attribution of 
recursive function calls. That is, collecting the current stack pointer allows the correct 
attribution of time to the correct instance of a function in recursive cases. In addition, 

30 attribution is facilitated in certain cases in which program flow is changed in non- 
traditional ways. 
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Referring again to the drawings. Figure 2 depicts an example method 200 for 
analyzing application performance, according to a particular embodiment of the present 
invention. At a block 202, the functions are analyzed to determine their entry and exit 
points. In particular, a function's entry point is the first statement in the function. Its exit 
5 points typically include the last statement in the function, as well as any points within the 
function at which other functions are called. 

Each of these entry and exit points is a potential location at which a probe might 
be inserted to collect data relevant to fiill call attribution. While full call attribution can 
be performed by inserting probes at all of these locations, it is usually not necessary to 
10 collect data at all of these points. Indeed, omitting instrumentation at certain non- 
essential points reduces the effect of the performance analysis process on the application 
whose performance is being analyzed, thereby improving accuracy. Accordingly, at a 
i3 block 204, the entry and exit points of the function are analyzed to determine which 

points, if instrumented with a probe, would yield non-redundant information, l e. , which 
15 points should be instrumented. 
:ji Figure 3 depicts a portion 300 of an application and its potential probe locations 

^ 302, 304, 306, 308, 310, and 312. For purposes of clarity, the potential probe locations 

O are depicted in italic type in Figure 3. As can be seen in Figure 3, potential locations for 

fO the _EnterFunction probes 302 and 310 occur at the beginnings of a main function 3 14 

;i; 20 and of a CalledToFunction function 316. Similarly, potential locations for the 
C _ExitFunction probes 308 and 3 12 occur at the ends of the main fimction 314 and of the 

CalledToFunction function 316. 

Some of the probes can be omitted so as to collect sufficient information to 
perform full call attribution without collecting redundant information. At the block 204 
25 of Figure 2, the system determines which probes can be omitted. In the specific example 
shown in Figure 3, the CallToFunction and RetumFromFunction probes provide 
redundant information. These probes are redundant because they would supply almost 
exactly the same information at almost the same time as other probes. For example, the 
^CallToFunction probe 304 in the main function 314 would supply almost the same 
30 information as the _EnterFunction probe 3 10 in the function 3 16 at nearly the same time, 
with only a shght time difference attributable to overhead for the probe itself Similarly, 
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the _RetumFromFunction probe 306 in the main function 314 would supply ahnost the 
same information as the _ExitFunction probe 3 12 in the function 3 16 at nearly the same 
time. Thus, because the _CallToFunction and __RetumFromFunction probes are 
redundant in this example, the potential probe locations 304 and 306 need not be 
5 instrumented, and are eliminated from consideration by the system at the block 204 of 
Figure 2. It is desirable to eliminate the _CallToFunction and _RetumFromFunction 
probes because there are typically more locations that call to functions than there are 
functions themselves. Thus, eliminating these probe pairs removes a larger portion of the 
total instrumentation, while ensuring that enough information to perform full call 
10 attribution is collected. 

While this strategy increases efficiency in most cases, it does not work for all 
cases. In particular, eliminating these probe pairs only works in the case of direct calls to 
^J"^ functions within the current module. Thus, while instrumentation of calls within the 

I' ; current module can be eliminated from the list of instrumentation points, instrumentation 

=^ 15 of calls outside of the current module cannot. Eliminating these probes would cause 

insufficient information for full call attribution to be gathered, since there is no guarantee 
C that other modules will be instrumented. Time can be incorrectly attributed as a result. 

For example, referring again to Figure 3, if the function 3 16 were located in a different 
+: module from the main function 314, eliminating the probes at locations 304 and 306 

p 20 would cause time spent in the function 3 1 6 to be incorrectly attributed to the main 

function 314. Thus, as a general rule, calls outside the current module are instrumented 
both before and after the call is performed, e.g. , if the main function 314 and the function 
316 were located in different modules, CallToFunction and RetumFromFunction 
probes would be inserted at locations 304 and 306, respectively. 
25 Another situation in which the normal strategy of instrumenting only the entry 

and exit points of functions produces potentially inaccurate results occurs in what is 
known as a tail merge. A tail merge is an optimization made either by a compiler or by 
the programmer at the machine instruction level that eliminates calls to functions by 
instead jumping to the next function, then having a single retmn statement return out of 
30 multiple functions. Figure 4 depicts an example of a tail merge situation. In Figure 4, a 
main function 402 calls a Functionl function 404. At a point 406, the function 404 in 
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turn calls a Fimction2 function 408. At the end of this function, a return statement 410 is 
encountered. This return statement 410, however, returns to the main function 402. This 
change in flow is contrary to the expected sequence of events, in which the return 
statement 410 would return to the fionction 404, which called the fimction 408. 
Normally, another return statement would redirect execution from the fimction 404 back 
to the main fimction 402, which called the fimction 404. Because a tail merge fimctions 
similarly to a call/return pair, it should be instrumented. A _CallToFunction probe 412 is 
inserted before the jump at the point 406, treating the jump as a call to the function 408. 
Since execution does not return to the point 406 after the jump, however, a 
RetumFromFunction probe placed after the jump would be superfluous. 

This departure from the normal sequence of events is a potential source of 
ambiguity in attributing time to the correct fimctions. Specifically, an issue arises as to 
the proper attribution of a time period 414 occurring after the point 406 and before the 
end of the main fimction 402. Typically, an _ExitFunction probe would precede the 
return statement 410 and would indicate that program flow is exiting the function 408. 
This probe, however, does not convey any information as to where program flow 
continues after exiting the function 408. Without additional information, this time is 
likely to be erroneously attributed to the function 404 because the function 408 was 
called by the function 404, but no end point was available within the function 404 with 
which to properly attribute time. The time period 414, however, should be attributed to 
the main fimction 402, which is where execution proceeds after the jump. This ambiguity 
can be resolved in a number of ways. One way is to flag the _CallToFunction probe 412 
as calling a fimction that has a tail merge. Another way is to insert a 
RetumFromFunction probe 416 at the point to which execution proceeds after the tail 
merge. Inserting the _RetumFroniFunction probe 416 at this location clarifies the 
destination of the return from the fimction 408 when the fimction frace is processed. 

Another scenario in which inserting probes at only the entry and exit points of 
fimctions does not provide sufficient information to perform fiiU call attribution arises in 
connection with blocks of code known as catch blocks and in connection with non-local 
go-to's. Both of these situations arise most commonly in connection with error handling 
mechanisms and can be instrumented in the same manner. Figure 5 illustrates a catch 
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block and its instrumentation. In the example depicted in Figure 5, a block of code 
attached to a catch statement 502, known as a catch block 502, is not called. Rather, 
execution jumps to the catch block 502 in the case of an exception throw 506 in a try 
block 508. Execution of the catch block 502 is neither a function call nor a tail merge. 
5 The catch block 502, however, can be treated as a function for purposes of performance 
analysis. In one implementation, the catch block 502 is treated as a reentrant call to the 
function that contains the catch block 502, i.e., a main function 510 in this case. Thus, in 
this example, an _EnterFunction probe 512 and an _ExitFunction probe 514 are inserted 
in the catch block 502 at the locations illustrated in Figure 5, with the function being 

10 entered and exited denoted as the main function 510. 

Some other error handling mechanisms produce non-local go-to' s by causing 
execution to jump to other blocks of code. For example, in the C programming language, 
the setjmp/longjmp facilities produce non-local go-to's. These blocks of code can be 
instrumented in the same manner as catch blocks, as discussed above in connection with 

15 Figure 5. 

Another situation that is noteworthy is that raised by inlined functions, which are 
functions marked as "inlined" by the developer as a directive to the compiler. A function 
is requested to be inlined when the developer surmises that the cost (in terms of the code 
generated) associated with calling, executing, and retuming from the function is greater 

20 than the cost associated with simply executing the code "inline" within the calling 

function. The compiler may or may not fulfill this request. If the request is fulfilled, the 
inlined function will become part of another function and will not have entry and exit 
points associated with it. Thus, the inlined function is not instrumented. If, on the other 
hand, the request is not fulfilled, then the function that was intended to be inlined will 

25 still have entry and exit points, which are instrumented with _EnterFunction and 

_ExitFunction probes, as appropriate. These probes are useful in that the developer does 
not expect the function to exist as a distinct entity. If time is attributed to a function that 
the developer expects to be inlined, then the developer can investigate the circimistances 
precluding the inlining of the function and can make changes to enable the function to be 

30 inlined. 
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Referring again to Figure 2, after the non-redundant probe locations are 
determined in connection with the block 204, execution proceeds to a block 206, at which 
the appropriate probes are inserted in the apphcation at only the non-redundant probe 
locations. Later, when the application is executed, the probes are used at a block 208 to 

5 collect information that includes, but is not necessarily limited to, information regarding 
execution of the probes' associated functions. In particular, this information may 
include, for example, the address of the function being entered or exited, the address from 
which another function is being called, and/or the address of a function being called. The 
collected information may also include a stack pointer to facilitate tracing of previous 

10 system activity, as well as a time stamp indicating when a function was entered or exited, 
as discussed above. After this information is collected, it is analyzed at a block 210 to 
determine how time should be attributed among the various functions making up the 
apphcation. 

By eliminatmg tiiose probes that would produce redundant information, the 
1 5 various implementations of the present invention reduce the amount of data collection 
activity performed while analyzing the performance of an application, thereby reducmg 
the extent to which the analyzed application's operation is distorted. As a result, the 
accuracy of the profiling process is improved. 

While the embodiments of the invention have been described with specific focus 
20 on their embodiment in a software implementation, the invention as described above is 
not limited to software embodiments. For example, the invention may be implemented in 
whole or in part in hardware, firmware, software, or any combination thereof. The 
software of the mvention may be embodied in various forms such as a computer program 
encoded in a machine-readable medium, such as a CD-ROM, magnetic medium, ROM or 
25 RAM, or in an electronic signal. Further, as used in the claims herein, the term "module" 
shall mean any hardware or software component, or any combination thereof 
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A computer-implemented method for collecting information relating to 
5 exe^dtion of an application, the method comprising: 

determining a set of probe locations in the appUcation at which collecting data 
relating to the execution of the appUcation would produce non-redundant information; 
and 

inserting probes only at the determined probe locations in the application. 

10 

2. The method of claim 1, further comprising determining entry and exit 
points of a plurality of functions constituting at least a portion of the appUcation. 

3. The method of claim 2, further comprising identifying the entry and exit 
15 points as probe locations at which probes are to be inserted. 

4. The method of claim 1, further comprising: 

identifying a first location within the appUcation at which a function call directs 
execution of the appUcation to a second location outside of a current module; and 
20 inserting a first probe before the identified first location and a second probe after 

the identified first location. 

5. The method of claim 4, wherein the first probe is configured to collect an 
address of a first function in which the identified first location is located, an address of a 

25 second function in which the second location is located, a first stack pointer, and a first 
time indicator, and the second probe is configured to collect the address of the second 
function, a second stack pointer, and a second time indicator. 

6. The method of claim 1, further comprising: 
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identifying a first location within a calling function at which execution of the 
apphcation is directed to a called function having an exit point at which execution of the 
application is directed to a second location outside of the calling function; and 

inserting a first probe before the first location and a second probe after the second 
5 location. 

7. The method of claim 6, wherein the first probe is configured to collect an 
address of the calling function, an address of the called function, a first stack pointer, and 
a first time indicator, and the second probe is configured to collect the address of the 

10 called function, a second stack pointer, and a second time indicator. 

8. The method of claim 1, further comprising: 

identifying a block of code to which execution of the application is directed upon 
occurrence of an error; and 



probe at an end of the identified block of code. 

9. The method of claim 8, wherein the first probe is configured to collect an 
address of the block of code, a first stack pointer, and a first time indicator, and the 

20 second probe is configured to collect the address of the block of code, a second stack 
pointer, and a second time indicator. 

10. The method of claim 1 , further comprising using the inserted probes to 
collect the information relating to the execution of the application. 
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inserting a first probe at a beginning of the identified block of code and a second 



25 



11. 



The method of claim 10, further comprising analyzing the collected 



information, 
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determining entry and exit points of a plurality of functions constituting at least a 
portion of the application; 

determining a set of probe locations in the application at which collecting data 
relating to the execution of the application would produce non-redundant information, the 
5 set of probe locations including at least the entry and exit points of the functions; 

inserting probes only at the determined probe locations in the application; 

using the inserted probes to collect the information relating to the execution of the 
apphcation; and 

analyzing the collected information. 

10 

1 3 . The method of claim 1 2, further comprising: 

identifying a first location within the application at which a function call directs 
"5 execution of the application to a second location outside of a current module; and 

^} inserting a first probe before the identified first location and a second probe after 

y ^ 

Q 15 the identified first location, 

P wherein the first probe is configured to collect an address of a first function in 

which the identified first location is located, an address of a second function in which the 

p second location is located, a first stack pointer, and a first time indicator, and the second 

probe is configured to collect the address of the second function, a second stack pointer, 

ffl 20 and a second time indicator. 

14. The method of claim 12, further comprising: 

identifying a first location within a calling function at which execution of the 
application is directed to a called function having an exit point at which execution of the 
25 apphcation is directed to a second location outside of the calling function; and 

inserting a first probe before the first location and a second probe after the second 
location, 

wherein the first probe is configured to collect an address of the calling function, 
an address of the called function, a first stack pointer, and a first time indicator, and the 
30 second probe is configured to collect the address of the called function, a second stack 
pointer, and a second time indicator. 
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15. The method of claim 12, further comprising: 

identifying a block of code to which execution of the appHcation is directed upon 
occurrence of an error; and 

inserting a first probe at a beginning of the identified block of code and a second 
probe at an end of the identified block of code, 

wherein the first probe is configured to collect an address of the block of code, a 
first stack pointer, and a first time indicator, and the second probe is configured to collect 
the address of the block of code, a second stack pointer, and a second time indicator. 



M A computer-readable medium having computer-executable instructions 



determining a set of probe locations in the application at which collecting data 
relating to the execution of the appKcation would produce non-redundant information; 
and 

inserting probes only at the determined probe locations in the application, 

17. The computer-readable medium of claim 16, having fiirther computer- 
executable instructions for determining entry and exit points of a plurality of functions 
constituting at least a portion of the application. 

18. The computer-readable medium of claim 1 7, having further computer- 
executable instructions for identifying the entry and exit points as probe locations at 
which probes are to be inserted. 

19. The computer-readable medium of claim 16, having further computer- 
executable instructions for: 

identifying a first location within the application at which a function call directs 
execution of the appUcation to a second location outside of a current module; and 

inserting a first probe before the identified first location and a second probe after 
the identified first location. 




20. The computer-readable medium of claim 19, wherein the first probe is 
configured to collect an address of a first function in which the identified first location is 
located, an address of a second function in which the second location is located, a first 
stack pointer, and a first time indicator, and the second probe is configured to collect the 
address of the second function, a second stack pointer, and a second time indicator. 

21. The computer-readable medium of claim 16, having further computer- 
executable instructions for: 

identifying a first location within a calling function at which execution of the 
appUcation is directed to a called function having an exit point at which execution of the 
application is directed to a second location outside of the calling function; and 

inserting a first probe before the first location and a second probe after the second 
location. 

22. The computer-readable medium of claim 21, wherein the first probe is 
configured to collect an address of the calling function, an address of the called function, 
a first stack pointer, and a first time indicator, and the second probe is configured to 
collect the address of the called function, a second stack pointer, and a second time 
indicator. 

23. The computer-readable medium of claim 16, having further computer- 
executable instructions for: 

identifying a block of code to which execution of the application is directed upon 
occurrence of an error; and 

inserting a first probe at a beginning of the identified block of code and a second 
probe at an end of the identified block of code. 

24. The computer-readable medium of claim 23, wherein the first probe is 
configured to collect an address of the block of code, a first stack pointer, and a first time 
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indicator, and the second probe is configured to collect the address of the block of code, a 
second stack pointer, and a second time indicator. 



25. The computer-readable medium of claim 16, having further computer- 

5 executable instructions for using the inserted probes to collect the information relating to 
the execution of the apphcation. 

26. The computer-readable medium of claim 25, having further computer- 
executable instructions for analyzing the collected information. 

10 

/A computer-readable medium having computer-executable instructions 

determining entry and exit points of a plurality of functions constituting at least a 
%l portion of the application; 

rJ 15 determining a set of probe locations in the application at which collecting data 

relating to the execution of the application would produce non-redundant information, the 
set of probe locations including at least the entry and exit points of the functions; 
inserting probes only at the determined probe locations in the application; 
J^f^ using the inserted probes to collect the information relating to the execution of the 

^ 20 application; and 

analyzing the collected information. 



28. The computer-readable medium of claim 27, having further computer- 
executable instructions for: 
25 identifying a first location within the application at which a function call directs 

execution of the application to a second location outside of a current module; and 

inserting a first probe before the identified first location and a second probe after 
the identified first location, 

wherein the first probe is configured to collect an address of a first function in 
30 which the identified first location is located, an address of a second function in which the 
second location is located, a first stack pointer, and a first time indicator, and the second 
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probe is configured to collect the address of the second function, a second stack pointer, 
and a second time indicator. 



29. The computer-readable medium of claim 27, having further computer- 
executable instructions for: 

identifying a first location within a calling function at which execution of the 
application is directed to a called function having an exit point at which execution of the 
application is directed to a second location outside of the calling function; and 

inserting a first probe before the first location and a second probe after the second 
location, 

wherein the first probe is configured to collect an address of the calling function, 
an address of the called fimction, a first stack pointer, and a first time indicator, and the 
second probe is configured to collect the address of the called fimction, a second stack 
pointer, and a second time indicator. 

30. The computer-readable medium of claim 27, having further computer- 
executable instructions for: 

identifying a block of code to which execution of the application is directed upon 

occurrence of an error; and 

inserting a first probe at a beginning of the identified block of code and a second 
probe at an end of the identified block of code, 

wherein the first probe is configured to collect an address of the block of code, a 
first stack pointer, and a first time indicator, and the second probe is configured to collect 
the address of the block of code, a second stack pointer, and a second time indicator. 



instructions for: 

determining a set of probe locations in the appHcation at which collecting data 
relating to the execution of the application would produce non-redundant information; 
and 

inserting probes only at the determined probe locations in the application. 




A computer arrangement configured to execute computer-executable 
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32. The computer arrangement of claim 3 1 , further configured to execute 
computer-executable instructions for determining entry and exit points of a plurality of 
functions constituting at least a portion of the application. 

5 

33. The computer arrangement of claim 32, further configured to execute 
computer-executable instructions for identifying the entry and exit points as probe 
locations at which probes are to be inserted. 

34. The computer arrangement of claim 3 1 , further configured to execute 
computer-executable instructions for: 

identifying a first location within the application at which a function call directs 
execution of the application to a second location outside of a current module; and 

inserting a first probe before the identified first location and a second probe after 
the identified first location. 

35. The computer arrangement of claim 34, wherein the first probe is 
configured to collect an address of a first function in which the identified first location is 
located, an address of a second function in which the second location is located, a first 
stack pointer, and a first time indicator, and the second probe is configured to collect the 
address of the second function, a second stack pointer, and a second time indicator. 

36. The computer arrangement of claim 31, further configured to execute 
computer-executable instructions for: 

25 identifying a first location within a calling function at which execution of the 

application is directed to a called function having an exit point at which execution of the 
application is directed to a second location outside of the calling function; and 

inserting a first probe before the first location and a second probe after the second 
location. 

30 
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37. The computer arrangement of claim 36, wherein the first probe is 
configured to collect an address of the calling function, an address of the called function, 
a first stack pointer, and a first time indicator, and the second probe is configured to 
collect the address of the called function, a second stack pointer, and a second time 
indicator. 

38. The computer arrangement of claim 31, further configured to execute 
computer-executable instructions for: 

identifying a block of code to which execution of the appUcation is directed upon 
occurrence of an error; and 

inserting a first probe at a beginning of the identified block of code and a second 
probe at an end of the identified block of code. 

39. The computer arrangement of claim 38, wherein the first probe is 
configured to collect an address of the block of code, a first stack pointer, and a first time 
indicator, and the second probe is configured to collect the address of the block of code, a 
second stack pointer, and a second time indicator. 

40. The computer arrangement of claim 3 1 , further configured to execute 
computer-executable instructions for using the inserted probes to collect the information 
relating to the execution of the application. 

41 . The computer arrangement of claim 40, further configured to execute 
computer-executable instructions for analyzing the collected information. 



A computer arrangement configured to execute computer-executable 
instructions for: 

determining entry and exit points of a plurality of functions constituting at least a 
portion of the application; 
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determining a set of probe locations in the application at which collecting data 
relating to the execution of the application would produce non-redundant information, the 
set of probe locations including at least the entry and exit points of the functions; 

inserting probes only at the determined probe locations in the application; 

using the inserted probes to collect the information relating to the execution of the 
application; and 

analyzing the collected information. 

43 . The computer arrangement of claim 42, further configured to execute 
computer-executable instructions for: 

identifying a first location within the appHcation at which a function call directs 
execution of the application to a second location outside of a current module; and 

inserting a first probe before the identified first location and a second probe after 
the identified first location, 

wherein the first probe is configured to collect an address of a first fimction in 
which the identified fu-st location is located, an address of a second function in which the 
second location is located, a first stack pointer, and a first time indicator, and the second 
probe is configured to collect the address of the second function, a second stack pointer, 
and a second time indicator. 

44. The computer arrangement of claim 42, further configured to execute 
computer-executable instructions for: 

identifying a fu-st location within a calling function at which execution of the 
application is directed to a called function having an exit point at which execution of the 
apphcation is directed to a second location outside of the calling function; and 

inserting a first probe before the first location and a second probe after the second 
location, 

wherein the first probe is configured to collect an address of the calling function, 
an address of the called fimction, a first stack pointer, and a first time indicator, and the 
second probe is configured to collect the address of the called fimction, a second stack 
pointer, and a second time indicator. 
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45. The computer arrangement of claim 42, further configured to execute 
computer-executable instructions for: 

identifying a block of code to which execution of the apphcation is directed upon 
occurrence of an error; and 

inserting a first probe at a beginning of the identified block of code and a second 
probe at an end of the identified block of code, 

wherein the furst probe is configured to collect an address of the block of code, a 
first stack pointer, and a first time indicator, and the second probe is configured to collect 
the address of the block of code, a second stack pointer, and a second time indicator. 
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ABSTRACT OF THE DISCLOSURE 



A method, apparatus, and software are disclosed for collecting information 
relating to the execution of an appHcation for use in analyzing the performance of the 
application. Probes are inserted at only those points in the apphcation that would yield 
5 non-redundant information. As a result, the impact of the information collection process 
on the application is reduced, improving the usefulness of the collected information. 
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void mainQ 
{ 



■302 



EnterFunctionQ; 



p304 

_CallToFunctionO; 
CalledToFunctionQ; r^"° 
ReturnFromFundionO; 



(-308 
_ExitFunction(); 



} 



void CalledToFunctionQ 
{ 
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EnterFunctionQ; 
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ExitFunctionO; 
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void mainQ 
{ 



call FunctionlQ; 
_ReturnFromFunction(); 



} 

void FunctionlQ 
{ 



} 



^412 

_CallToFunctionO; /^406 
jump to Function2(); 



void Function2() 
{ 
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— 402 



— 404 



r 
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-408 



return; 



// this returns to mainQ, not to FunctionlQ 
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void mairiO 
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_EnterFunctionQ; 

try 

{ 



^506 
throw(); 
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catch 
{ 
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_EnterFunctionO; 
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} 

_ExitFunction(); 



_ExitFunction(); 
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§1.56 Duty to disclose information material to patentability. 

(a) A patent by its very nature is affected with a pubhc interest. The public interest is best served, and the most effective patent 
examination occurs when, at the time an application is being examined, the Office is aware of and evaluates the teachings of all information 
material to patentability. Each individual associated with the filing and prosecution of a patent application has a duty of candor and good 
faith in dealing with the Office, which includes a duty to disclose to the Office all mformation known to that individual to be material to 
patentability as defined in this section. The duty to disclose information exists with respect to each pending claim until the claim is canceled 
or withdrawn from consideration, or the application becomes abandoned. Information material to the patentability of a claim that is 
canceled or withdrawn from consideration need not be submitted if the information is not material to the patentability of any claim 
remaining under consideration in the application. There is no duty to submit information which is not material to the patentability of any 
existing claim. The duty to disclose all information known to be material to patentability is deemed to be satisfied if all information known 
to be material to patentability of any claim issued in a patent was cited by the Office or submitted to the Office in the manner prescribed by 
§ § 1 .97(b)-(d) and 1 .98 . However, no patent will be granted on an application in connection with which fraud on the Office was practiced 
or attempted or the duty of disclosure was violated through bad faith or intentional misconduct. The Office encourages applicants to 
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(1) prior art cited in search reports of a foreign patent office in a coimterpart appHcation, and 

f :. (2) the closest information over which individuals associated with the filing or prosecution of a patent apphcation believe any 
pendmg claim patentabiy defines, to make sure that any material information contained therein is disclosed to the Office. 

i^) Under this section, information is material to patentability when it is not cumulative to information akeady of record or being 
made of record in the application, and 

=• (I) It estabhshes, by itself or in combination with other information, a prima facie case of unpatentability of a claim; or 

- (2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i) Opposing an argument of unpatentability rehed on by the Office, or 

f_ (ii) Asserting an argument of patentability . 

A'prima facie case of unpatentability is established when the information compels a conclusion that a claim is unpatentable under the 
preponderance of evidence, burden-of-proof standard, giving each term in the claim its broadest reasonable construction consistent with the 
sjiefeification, and before any consideration is given to evidence which may be submitted in an attempt to estabhsh a contrary conclusion of 
patentability. 

(c) Individuals associated with the filing or prosecution of a patent application within the meaning of this section are: 

(1) Each inventor named in the application: 

(2) Each attorney or agent who prepares or prosecutes the application; and 

(3) Every other person who is substantively involved m the preparation or prosecution of the apphcation and who is 
associated with the inventor, with the assignee or with anyone to whom there is an obHgation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing mformation to the attorney, 
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